A Survey on High-Dimensional Subspace Clustering
نویسندگان
چکیده
With the rapid development of science and technology, high-dimensional data have been widely used in various fields. Due to complex characteristics data, it is usually distributed union several low-dimensional subspaces. In past decades, subspace clustering (SC) methods studied as they can restore underlying perform fast with help self-expressiveness property. The SC aim construct an affinity matrix by self-representation coefficient then obtain results using spectral method. key how design a model that reveal real structure data. this survey, we focus on two decades present new classification criterion divide them into three categories based purpose clustering, i.e., low-rank sparse SC, local preserving kernel SC. We further subcategories according strategy constructing representation coefficient. addition, applications face recognition, motion segmentation, handwritten digits speech emotion recognition are introduced. Finally, discussed interesting meaningful future research directions.
منابع مشابه
A Preview on Subspace Clustering of High Dimensional Data
When clustering high dimensional data, traditional clustering methods are found to be lacking since they consider all of the dimensions of the dataset in discovering clusters whereas only some of the dimensions are relevant. This may give rise to subspaces within the dataset where clusters may be found. Using feature selection, we can remove irrelevant and redundant dimensions by analyzing the ...
متن کاملSubspace Clustering of High Dimensional Data
Clustering suffers from the curse of dimensionality, and similarity functions that use all input features with equal relevance may not be effective. We introduce an algorithm that discovers clusters in subspaces spanned by different combinations of dimensions via local weightings of features. This approach avoids the risk of loss of information encountered in global dimensionality reduction tec...
متن کاملSubspace Clustering for High Dimensional Categorical Data
A fundamental operation in data mining is to partition a given dataset into clusters such that objects in the same cluster are more similar to each other than objects in different clusters according to some defined criteria [2]. These criteria are usually defined in the form of some distance, and similarity is hence defined as follows, the smaller the distance is, the more similar the objects a...
متن کاملSoft Subspace Clustering for High-Dimensional Data
High dimensional data is a phenomenon in real-world data mining applications. Text data is a typical example. In text mining, a text document is viewed as a vector of terms whose dimension is equal to the total number of unique terms in a data set, which is usually in thousands. High dimensional data occurs in business as well. In retails, for example, to effectively manage supplier relationshi...
متن کاملA Survey on Soft Subspace Clustering
Subspace clustering (SC) is a promising technology involving clusters that are identified based on their association with subspaces in high-dimensional spaces. SC can be classified into hard subspace clustering (HSC) and soft subspace clustering (SSC). While HSC algorithms have been studied extensively and are well accepted by the scientific community, SSC algorithms are relatively new. However...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematics
سال: 2023
ISSN: ['2227-7390']
DOI: https://doi.org/10.3390/math11020436